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SYSTEM, METHOD AND PROGRAM FOR 
PRINTING AN ELECTRONIC DOCUMENT 

BACKGROUND OF THE INVENTION 

1 . Field of the Invention 

[0001] This invention relates to a technology for printing processing in which an electronic 
document is downloaded via a network such as the Internet and printed as a booklet or a book. 
2. Description of the Related Art 

[0002] To download and print an electronic document stored on a network, hardware is required 
to enable a continuous connection or dialup connection to the Internet. An electronic document 
that is stored as a file such as a HyperText Markup Language (HTML) file may include links to 
other files. Accordingly, printing a page of the electronic document may require downloading the 
linked files after downloading the main file. 

[0003] If the files forming the electronic document are temporarily stored in a local area of a 
computer before printing, a user needs to modify the absolute description of the links in the main 
file to the location of the files in the local area. Further, printing an electronic document 
displayed by a Web viewer directly causes some problems: printing navigation icons such as a 
previous-page button icon, a next-page button icon, a top-page button icon, and navigation 
banner, which are necessary for browsing but not necessary in the printed material, reduces the 
density of information on the printed page. 

[0004] In order to shorten the time required to print an electronic document described in HTML, 
Japanese Unexamined Patent Application Publication No. 11-25156 describes a method in 
which a system downloads the files forming the Web page, automatically embeds the description 
of the linked file (the linked page) at the point of the link anchor of the main file (the main page), 
and prints the new edited file. According to the method disclosed in JP 11-25156, the 
inconvenience of downloading the linked page again and correcting the description of the linked 
files in the original page manually may be eliminated. The conventional method may also 
reduce the blank space on each page to some degree. However, in JP 11-25156, the 
description of the linked page is simply inserted at the position in the original main HTML file and 
the processed main page is simply and continuously printed. Therefore, according to the 
method disclosed in JP 11-25156, printing or processing Web pages that do not supply a printed 
material having the proper style of a booklet or book may be difficult. 
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[0005] It Is one object of the present invention to provide a system, a method and a computer 
program for printing such a downloaded electronic document, for example a online help file or an 
online manual file, promised on viewing by a Web viewer onto sheet material to obtain a real 
printed material having the proper style of a booklet or a book. 

[0006] Specifically, it is the object of the present invention to provide a system, a method and a 
computer program for printing a downloaded electronic document as a real printed material 
having a front cover printed such the title of the document, the table of contents and so on; or a 
header, a footer, or both of them printed the title of a chapter printed on pages. 
[0007] Further, it is another object of the present invention to provide a system, a method and a 
computer program for printing any part selected arbitrarily by a user from a downloaded 
electronic document as a real printed material having the proper style of a booklet or a book. 
[0008] In order to accomplish the above objects, according to one aspect of the present 
invention, there is provided a system for printing processing an electronic document described in 
a markup language comprising analysis defining information storing means for storing analysis 
defining information for analysis which is determined based on a description rule and a template 
used for describing an electronic document in the markup language; cover file generating means 
for extracting cover information displayed on a cover of the electronic document based on tag 
information on a tag and a directive described in the document file and for generating a cover file 
of the electronic document; file analyzing and converting means for analyzing the description of 
the document file based on the analysis defining information, for deleting information 
unnecessary for printing of the document from the document file, for analyzing a document 
structure of the electronic document, for dividing the document file on item unit generating the 
document structure in accordance with the analyzed document structure, and for generating a 
converted file; and printing processing means for printing a document with the cover based on 
the cover file and the converted file in accordance with a request for printing the electronic 
document. 

[0009] According to the above-mentioned invention, the system accesses an electronic 
document server for managing an electronic document via a network such as the Internet, and 
obtains document files in a markup language (e.g., HTML) forming one electronic document. 
Further, the system previously stores analysis defining information for analysis which is 
determined based on a description rule and a template used for describing the electronic 
document in the markup language. Then, the system extracts cover information for displayed on 
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a cover of the electronic document based on tag information on a tag and a directive described 
in the document file, and generates a cover file of the electronic document. Then, the system 
analyzes the description of the document file based on the analysis defining information, and 
deletes information unnecessary for printing of the document from the document file. Then, a 
system analyzes a document structure of the electronic document, divides the document file on 
item unit forming the document structure in accordance with the analyzed document structure, 
and generates a converted file. After that, the system prints a document with the cover based on 
the cover file and the converted file in accordance with a request for printing the electronic 
document. 

[0010] In addition, according to the present invention, the system, upon receiving a request for 
printing-with-cover indicating to print the electronic document with the cover, prints the document 
with the cover printed such cover information like the title of document, the table of contents, a 
copyrights notice based on the cover file and the converted. On the other hand, on receiving a 
request for printing-without-cover indicating to print the electronic document without the cover 
from a user, the system prints the documents with the cover information to a header part, a 
footer part, or both the header and the footer parts. 

[0011] Further, according to the present invention, the system generates contents structure 
information (structured-contents structure) for generating a table of contents of the electronic 
document and information (file-corresponding information) having a corresponding relationship 
between items of the table of contents and the converted file based on the analyzed document 
structure, and displays the table of contents in which the display or not-display of classes in the 
document structure is dynamically controlled, based on the content structure information and the 
corresponding information, receives the item selected for printing from the table of contents, and 
prints the document including the selected item with the cover on the basis of the converted file 
corresponding to the selected item in accordance with a request of the electronic document. 
[0012] As mentioned above, according to the present invention, it is possible to print the 
downloaded electronic document as a real document having an appearance of a booklet or a 
book, irrespective of the connection with the Internet of the computer terminal. In particular, it is 
possible to print the electronic document having the front cover printed necessary items such as 
a title of a document, a document code, and a copyright notice. 

[0013] Further, according to the present invention, it is possible to print the electronic document 
having necessary items such as a title of a chapter or section and a copyright notice printed in a 
header part, a footer part, or both of them on each page. 

[0014] Furthermore, according to the present invention, it is possible to display, on a screen of a 
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computer, a table of structured contents generated from the files forming an electronic document 
and to print only the items selected arbitrarily by a user while viewing the table of structured 
contents and some contents of the electronic document. 

BRIEF DESCRIPTION OF THE DRAWINGS 

[0015] FIG. 1 is a block diagram showing an example of a configuration of an electronic 
document printing system according to the present invention; 

FIG. 2 is a diagram showing an example of a display screen of a page viewer for 
displaying a page as a printing target in the electronic document; 

FIG. 3 to FIG. 5 are diagrams showing parts of source of an HTML file of a body text page 
as shown in FIG. 2; 

FIG. 6 to FIG. 8 are diagrams showing a source of a converted file; 

FIG. 9 is a diagram showing an example of a display screen of a table of structured 
contents and a page of the converted file; 

FIG. 10 is a diagram showing images of a real document as a printing result; 

FIG. 11 is a flowchart for file obtaining processing; 

FIG. 12 to FIG 14 are diagrams showing parts of a source of the downloaded file 
"index.html" so as to describe the file obtaining processing; 

FIG 15 is a flowchart for file analyzing and converting processing; 

FIG 16 is a diagram showing a part of a source of the HTML file so as to describe the 
downloading operation in data analyzing and converting processing; 

FIG. 17 is a diagram showing examples of tags as an analyzed target; 

FIG. 18 is a diagram showing examples of tags added in the conversion; 

FIG. 19 is a diagram showing examples of items (modules) forming a table of structured 
contents structure; 

FIG. 20 is a flowchart for structured-contents display processing; 

FIG. 21 is a diagram showing examples of display icons; 

FIG. 22 is a flowchart for printing processing; 

FIG. 23 is a flowchart for cover forming processing; 

FIG. 24 is a flowchart for preview processing; and 

FIG. 25 is a diagram showing examples of functions (modules) used for a computer 
program realizing the electronic document printing system. 

DESCRIPTION OF THE PREFERRED EMBODIMENTS 

[0016] FIG. 1 is a diagram showing an example of an electronic document printing system 
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according to the present invention. An electronic document printing system 1 connects to an 
electronic document server 3 via a network 2. The electronic document server 3 manages an 
electronic document 4 premised on being displayed by a Web viewer and let the electronic 
document 4 downloads based on a user's request. The electronic document 4 is an HTML file 
described in HTML. The electronic document printing system 1 has a file obtaining unit 10, a file 
analyzing and converting unit 20, and a printing processing unit 40. 

[0017] The file obtaining unit 10 downloads the electronic document 4 via the network 2 from 
the electronic document server 3. The file obtaining unit 10 has a downloaded-file determining 
unit 11 and a downloading unit 12. The downloaded-file determining unit 11 determines a 
plurality of files forming one electronic document 4 as a target for printing based on a URL 
(Uniform Resource Locator) assigned by for example a user. The downloading unit 12 
downloads the files determined as a target for downloading by the downloaded-file determining 
unit 11 from the URL of the files forming the electronic document 4. Further, the downloading 
unit 12 stores the downloaded files in the electronic document printing system 1. The 
downloaded files of the electronic document 4 are assumed as an HTML files 51 . 
[0018] The file analyzing and converting unit 20 analyzes the description of each file of the 
HTML files 51 downloaded by the file obtaining unit 10 on the basis of analysis defining 
information 50. Then, the file analyzing and converting unit 20 converts the HTML files 51 to a 
converted file 53 the description of which is converted to be suitable for printing onto sheet 
material and forms a cover file 52 based on the HTML files 51 . The file analyzing and converting 
unit 20 has a cover-file generating unit 21 , an analyzing and converting unit 22, and a contents- 
structure generating unit 23. 

[0019] The analysis defining information 50 is information for the analysis of tags as targets in 
the analyzing processing, the determination rule used in the analyzing processing, and addition 
of tags in the converting processing, the information of which is determined on the basis of the 
template or the description rule used upon describing the electronic document 4 in HTML. The 
electronic document printing system 1 stores the analysis defining information 50 predetermined 
by the structural characteristic of the HTML file expected to download. 

[0020] The cover-file generating unit 21 generates the cover file 62 from the file for the top page 
in the HTML files 51 downloaded by the file obtaining unit 10. The front cover of a document 6 is 
printed based on the cover file 52 in the printing processing. The analyzing and converting unit 
22 analyzes the description of each file of the HTML files 51 on the basis of the analysis defining 
information 50 and converts the HTML files 51 so that the HTML files 51 be suitable for the 
printing onto sheet material. Specifically, the analyzing and converting unit 22 analyzes the 
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description of each file of the HTML files 51 and downloads a HTML file or an image file referred 
in a body text of the document. Further, the analyzing and converting unit 22 deletes information 
unnecessary in the printed document 6. 

[0021] Furthermore, the analyzing and converting unit 22 analyzes a structure of contents of the 
electronic document 4 based on the description of the HTML files 51 . Then, the analyzing and 
converting unit 22 divides one file of the HTML files 51 into such an item unit like "chapter", 
"section" and "unit" of the document structure and stores the converted files 53 in the electronic 
document printing system 1 through the analyzing and converting processing. 
[0022] Further, based on the result of the analyzing processing, the analyzing and converting 
unit 22 stores file-corresponding information 54, which is a corresponding relationship between a 
hierarchical structure of the converted files 53 divided according to the structure of the electronic 
document 4 and the converted files 53 used for obtaining each title of items in the table of 
structured contents. The contents-structure generating unit 23 forms a structured-contents 
structure 55 used for managing the table of structured contents displayed by printing processing 
unit 40 on the basis of the analyzing result of the analyzing and converting unit 22. The 
structured-contents structure 55 is information on the structure of the electronic document 4. 
namely, a hierarchical relationship of the converted files 53. 

[0023] The printing processing unit 40 prints the contents of the HTML files 51 as a real printed 
material having the proper style of a booklet or a book. Further, the printing processing unit 40 
has a structured-contents displaying unit 41 , a document displaying unit 42, a printing item 
obtaining unit 43, a printing unit 44, and a previewing unit 45. 

[0024] The structured-contents displaying unit 41 displays the table of structured contents 
indicating the hierarchical relationship with the converted files 53 in tree structure on the Web 
viewer (not shown in FIG.1). In the table of structured contents, each file of the converted files 
53 corresponds to the item unit such as "chapter, section, and item" of the document structure of 
the electronic document 4. The structured-contents displaying unit 41 dynamically controls the 
display or not-display status of the item of the table of structured contents by such a user's 
operation like a click. The structured-contents displaying unit 41 can additionally displays the 
icon set by the predetermined file category of the file of the converted files 53 corresponding to 
the item of the table of structured contents. 

[0025] The document displaying unit 42 displays, on the Web viewer, the content of the file 
determined as the head item of the table of structured contents or the content of the converted 
files 53 corresponding to the item selected of the table of structured contents. The printing item 
obtaining unit 43 obtains the item selected as a target of printing in the table of structured 
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contents displayed on the Web viewer. The printing unit 44 prints the item obtained by the 
printing item obtaining unit 43 onto sheet material having the proper style of a booklet or a book 
as the document 6. The previewing unit 45 displays the contents of the item of the document 6 
set for printing. 

[0026] The general description of processing of the electronic document printing system 1 is 
given below. 

[0027] It is assumed that the electronic document 4 downloaded by the file obtaining unit 10 in 
the electronic document printing system 1 is an online manual of a personal computer, which 
manual is described in HTML. FIG. 2 shows an example of a display screen 100 of a Web 
viewer for displaying one page as a target of printing in the electronic document 4. FIG. 3 to FIG. 
5 show examples of parts of source of the HTML file of a body text of the page displayed on the 
display screen 100, The parts of source shown in FIG. 3 to FIG. 5 are of one HTML file, FIG. 3 
shows a head part, FIG. 4 shows an intermediate part, and FIG. 5 shows the end part. 
[0028] Generally, as shown on the display screen 100, the Web page in the electronic document 
4 is displayed on the Web viewer based on the description of the HTML file interpreted for 
display and comprised of a contents 110 indicating the table of contents and a body text 120 
indicating the contents of the electronic document 4. The page as the body text 120 is included 
a navigation button 130 as a part of the electronic document 4 and the contents of the HTML file 
for display, and the contents of body text 120 is combined and obtained from plural classes of 
the structure of the document for displaying. As indicated by the contents 11 0, an item "1 . Name 
and function of unit" in the body text 120 denotes the content belonging to the class direct below 
"Chapter 1 : Name of unit" of the first class, namely, the item in the second class (section). 
Further, an item "personal computer main body (hereinafter, referred to as a PC main body)" and 
an image 140 of the PC main body in the body text 120 denote the contents belonging to the 
class direct below "1 . Name and function of unit", namely, the items in the third class (item). 
[0029] The file analyzing and converting unit 20 analyzes and converts each file of the 
downloaded HTML files 51 so as not to print the Web page of the online manual as exactly 
shown on the display screen 1 00 but to print the page in a appropriate style for a real printed 
material (a manual). That is, the file analyzing and converting unit 20 generates a cover file 52 
for printing the front cover of the document 6 to print the electronic document 4 as a proper 
styled printed material. Then, the file analyzing and converting unit 20 deletes unnecessary 
information such as the navigation button 130 incorporated from the content in the page of the 
body text 120. Further, the file analyzing and converting unit 20 divides the HTML files 51 
corresponding to the body text 120 into such an item as "chapter, section, or item" in the 
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document structure on the item unit basis, and generates the converted files 53 from the divided 
items. Then, the file analyzing and converting unit 20 stores, as file-corresponding information 
54, information about a hierarchical relationship among the divided and generated classes of the 
converted files 53 as a structured-contents structure 55. Specifically, the file analyzing and 
converting unit 20 analyzes that the description of the HTML file 51 includes "1 . Name and 
function of unit (Chapter 1 , Section 1)" as the item in the second class (section) forming 
"Chapter 1 : Name of unit" in the first class (chapter) and "front of personal computer main body 
(hereinafter, referred to as front of PC main body) (Chapter 1 , Section 1 , Item 1)" as the under- 
item in the third class (item). Then, the file analyzing and converting unit 20 divides the HTML file 
51 into the items based on the analyzing result, and generates two converted files 53. 
[0030] FIG. 6 to FIG. 8 show the source of the converted files 53. FIG. 6 shows the source of 
the file of "1 . Name and function of unit" divided from the HTML file as items of the second class 
(section) shown in FIG. 3 to FIG 5. FIG. 7 and FIG. 8 show the source of the file of "front of PC 
main body" as the item in the third class (item) divided from the HTML file 51 . 
[0031] FIG. 9 shows an example of the display screen 200 displaying the table of structured 
contents and the page based on the converted files 53 subjected to the analyzing and converting 
processing. On the display screen 200 are displayed a structured contents 210 indicating, in 
tree structure, the table of structured contents of the electronic document 4 and a body text 220 
indicating the content based on the converted files 53, which is obtained by deleting the 
information unnecessary for the printing and by dividing it on the item unit basis. Referring to 
FIG. 9, one of the converted files 53 in the page displayed on the body text 220 is the file 
indicating "1. Name and function of unit" in the items of the second class (secfion). Thus, as 
shown the body text 120 in FIG. 2, the content "front of PC main body" in the third class (item) 
and the image 140 in the PC main body are not displayed simultaneously with the content "1 . 
Name and function of unit". 

[0032] The user can select the items for printing from the table of structured contents displayed 
on the structured contents 210 of the display screen 200. The printing processing unit 40 
combines the cover file 52 and the converted files 53 corresponding to the selected items into 
one file for printing job, and prints the file thereof. FIG. 10 shows the image of the printed 
material of the document 6 as a printing result. Thus, the electronic document printing system 1 
can obtain the document 6 as the real printed material having the appearance of the booklet as 
shown the image in FIG. 10 from the electronic document 4 described as the HTML files 51 
premised on displaying on the Web viewer as shown on the display screen in FIG. 2. 
[0033] Here, a detailed description of processing of various means in the electronic document 
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printing system 1 is given. FIG. 11 shows a flowchart for the file obtaining processing. The 
downloading unit 12 accesses a URL inputted by user's operation or other programs' execution, 
then downloads the HTML file having the description of the table of contents of the electronic 
document 4, for example the "index.html" file (step SI). The downloaded-file determining unit 11 
generates a list of the HTML files as targets of downloading from the downloaded "index.html" 
file, and calculates the total number of files to be downloaded (step S2). Specifically, the 
downloaded-file determining unit 11 extracts the URL which is defined by <href> from a first <hr> 
tag to a last <hr> tag in the "index.html" file, and generates the list of the HTML file to be 
downloaded based on the extracted URL. When the same URL is described a plurality of times 
and is has been overlapped to the URL registered to the list, it is ignored. 
[0034] FIG. 12 to FIG. 14 show parts of the source of one loaded "index.html" file. FIG. 12 to 
FIG. 14 show the source of a continuous portion from the head portion of the "index.html" file. 
The URLs defined by <href> existing from a portion shown by a downward arrow in FIG 12 to a 
portion shown by an upward arrow in FIG. 14 are URLs of the HTML file as the description of the 
items forming the table of structured contents. The downloaded-file determining unit 11 registers 
the URLs to the list of the downloaded file. The downloading unit 12 downloads the HTML file 
constituting the electronic document 4 based on the list of the downloaded file, and stores the 
downloaded files as the HTML files 51 (step S3). 

[0035] FIG. 15 shows a flowchart for the file analyzing and converting processing. The file 
analyzing and converting unit 20 obtains the URLs of the files downloaded from the file obtaining 
unit 10 and a path to a directory storing the HTML files 51. The analyzing and converting unit 22 
determines whether or not the HTML files 51 to be analyzed exists (step S11). When the HTML 
files 51 to be analyzed exists, the processing sequence after step SI 2 is performed. The 
analyzing and converting unit 22 downloads the HTML file and such a file as an image file 
defined in the HTML files 51 and stores the obtained files (step SI 2). Specifically, the analyzing 
and converting unit 22 refers to all the HTML files 51 in the downloading order, downloads the 
HTML file based on the URL defined by <href> from the first <hr> tag to the last <hr> tag in the 
HTML file, and further downloads the image file defined by an <img> tag. The analyzing and 
converting unit 22 does not download the same file as the downloaded file to avoid duplicated 
storing. 

[0036] FIG. 16 shows a part of the source of the HTML files 51 downloaded by the file obtaining 
unit 1 0. For example, the HTML file is downloaded with the URL defined based on the <href> 
tag existing between a downward arrow and an upward arrow shown in FIG. 16. 
[0037] The analyzing and converting unit 22 analyzes the description of the HTML files 51 
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based on the analysis defining information 50, deletes the unnecessary information, divides the 
description of the file into each unit of "chapter, section, or item" forming the document structure 
as the converting processing. Then, the analyzing and converting unit 22 generates and stores 
the converted file and writes the analyzing result as the file-corresponding information 54 (step 
S13). For example, the analyzing and converting unit 22 deletes the information described 
before the existing position of the first <hr> tag and the information described after the existing 
position of the last <hr> tag in the HTML files 51 in progress of the analysis. The analyzing and 
converting unit 22 deletes the <href> tag in the HTML files 51 . Further, when the HTML files 51 
includes <h1>, <h2>, and <h3> tags, the analyzing and converting unit 22 generates one file by 
the information described before the existing position of these tag, and further generates another 
file by the information described sequent thereto. 

[0038] Here, a file name comprising "number string of 12 figures" and a ".html" extension are set 
to the converted files 53. The 12-figure portion in the file name is equally divided into four 
portions by three figures ("000111222333.htmr), and segments the four portions into a first 
class, a second class, a third class, and fourth class, starting from the head. For example, the 
file name is as follow: 

1) The name of the file of "Chapter 1" is "OOlOOOOOOOOO.html", and the file thereof 
belongs to the first class, 

2) The name of the file of "Chapter 2" is "002000000000.html", and the file thereof 
belongs to the first class, 

3) The name of the file of "Chapter 2, Section 1" is "002001000000.html", and the file 
thereof belongs to the second class, 

4) The name of the file of "Chapter 2, Section 2" is "002002000000.html". and the file 
thereof belongs to the second class, 

5) The name of the file of "Chapter 2, Section 2. Item 1" is "002002001 000. html", and the 
file thereof belongs to the third class, and 

6) The name of the file of "Chapter 3" is "003000000000. html" and the file thereof 
belongs to the first class. 

[0039] The URLs (file names) of the HTML file are registered in the structured order to the file- 
corresponding information 54 so as to obtain the titles indicating the items in the table of 
structured contents. 

[0040] Next, the cover-file generating unit 21 generates the cover file (HTML file) 52 based on 
data in the downloaded HTML files 51 by the downloading unit 12 (step SI 4). For example, the 
cover-file generating unit 21 extracts the manual name, the manual code, and information on the 
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copyright notice to be described on the front cover, from the HTIVIL files 51 , for example a 
"top.html" file, forming the top page in the electronic document 4, and generates and stores the 
cover file 52. 

[0041] Specifically, the cover-file generating unit 21 generates the cover file 52 as follows: 

1) The cover-file generating unit 21 determines a linefeed code and obtains a character 
string, on line unit basis, from the description of the target HTML file 51 such as the "top.html" 
file. 

2) The cover-file generating unit 21 checks whether or not characters other than the 
HTML tag exists in the obtained character string and obtains characters other than the HTML 
tag, 

3) If the above procedure is performed as the first one, the cover-file generating unit 21 
sets the obtained characters (character string) other than the HTML tag as "manual name", 

4) If the above procedure is performed as the second one, the cover-file generating unit 
21 sets the obtained characters (character string) other than the HTML tag as "manual code", 

5) The cover-file generating unit 21 checks whether or not the obtained character string 
includes characters such as "copyright", and then, in case the characters of "copyright" exists in 
the obtained characters, sets the obtained characters (character string) as the copyright notice, 
and 

6) The cover-file generating unit 21 insert the character strings set as the manual name, 
manual code, and copyright notice into the predetermined template of the front cover sheet as 
the HTML file, thereby generates the cover file 52. 

[0042] The analysis defining information 50 used for the analysis and conversion processing is 
information defining the tag as the targets of analyzing or adding in the converted files 53. 

FIG. 17 shows examples of the tag as a target of analyzing which is defined by the 
analysis defining information. FIG. 1 8 shows examples of the tag added in the converting 
processing. Referring to FIG. 17, <h1>, <h2>, and <h3> tags are used for the determination 
upon dividing the file. The <hr> tag is used for the determination upon deleting the unnecessary 
information. An <href> tag is used for the determination upon downloading the HTML file. An 
<src img> tag is used for the determination upon downloading an image file. A <title> tag is 
used when the title information of the HTML file is necessary. The <title> tag is used upon 
obtaining a character string (title) registered to item in the table of structured contents or upon 
obtaining the character string of the item selected when the front cover page is formed in the 
printing processing or the preview processing. 

[0043] An <meta http-equiv= "Content-Type" content="text/html; charset=iso-2022-jp"> tag is 
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inserted into the header portion of the converted file (HTML file) 53 which is generated through 
the analyzing processing. An <STYLE TYPE="te)ct/css">H6{page-break- 
before:always}</STYLE> tag is inserted into the header portion of the HTML file for printing job 
in the printing processing or the preview processing. In case the tag is inserted and an <H6> tag 
is described in the HTML file, the page-break is executed at the position of the tag described. 
The <H6> tag is described the position set the page-break command in the HTML file for printing 
job in the printing processing or the preview processing. 

[0044] Referring to FIG. 18, an <NAME>***</NAME> tag is used in the header portion in the 
cover file 52. A character string indicating a manual name (document name) is described as a 
directive "***" between the tags. A <CODE>***</CODE> tag is used in the header portion in the 
cover file 52. A character string indicating the manual code (document code) is described as a 
directive "***" between the tags. A<COPYRIGHT>***</COPYRIGHT> tag is used in the header 
portion in the cover file 52. A character string indicating the copyright notice is described as a 
directive "***" between the tags. A <K*> </K*> tag indicates which class the HTML file belongs 
to. The tag information is used upon changing the displayed class of the item corresponding to 
the HTML file 51 . For example, a set of <K1 > </K1> tags indicates the first class and a set of 
<K2> </K2> tags indicates the second class. 

[0045] A <BASEFILENAME="***"> tag indicates the file name of the converted files 53 after the 
completion of the analyzing and converting processing. The tag information is used upon 
changing the displayed class, an <SUBTITLE="***"> tag indicates the title of the converted files 
53. The information of this tag is used upon changing the displayed class, a <viewdocs> <K*> 
<BASEFILENAME="***"> <SUBTITLE="***"> </K*> </viewdocs> tag is inserted into the top of 
the body portion in the converted files 53 generated in the analyzing and converting processing. 
[0046] Further, the contents-structure generating unit 23 generates the structured-contents 
structure 55 based on the file-corresponding information 54 (step SI 5). 
[0047] The structured-contents structure 55 is information for managing the configurafion of the 
table of structured contents. FIG. 19 shows examples of modules comprising the structured- 
contents structure 55. The structured-contents structure 55 comprises the following modules. 
An "HTREEITEM hTreeltem" module is a tree handle, for managing the table of structured 
contents, transmitted upon registering an item to the table of structured contents. The tree 
handle is obtained by a return value of an "InsertltemQ" function which is used upon registering 
the item to the table of structured-contents. A "CString strTitle" module indicates the title 
displayed as an item in the table of structured contents. An "int iRoot" module indicates a flag for 
determining whether or not an item belongs to a root class (first class). If the value of the 
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module equals zero (=0), the module indicates the item belongs to a child class, namely, the 
second or subordinated class. If the value of the module equals one (= 1), the module indicates 
the item belongs to the root, namely, the first class. The "int iRootCnt" module indicates the 
registered number of the mother item to the table of structured contents. An "int iDeep" module 
is a flag for determining which class an item belongs to. If the value of the module equals one (= 
1), the module indicates the item belongs to the first class. If the value of the module equals two 
(= 2), the module indicates the item belongs to the second class. If the value of the module 
equals three (= 3), the module indicates the item belongs to the third class. If the value the 
module equals four (= 4), the module indicates the item belongs to the fourth class. A "CString 
strPath" module indicates path information of the registered converted-file 53. A"bool 
bCheckOn" module is a flag for determining whether or not the item is selected on the table of 
structured contents. If the value of the module is TRUE, module indicates the check box of the 
table of structured contents is ON and the item selected. If the value of the module is FALSE, 
the module indicated the check box of the table of structured contents is OFF and the item not- 
selected. 

[0048] The contents-structure generating unit 23 obtains information on class relationship 
between the files based on the file name having 12-figure numbers assigned to each file of 
converted file 53, and sets the obtained information to the structured-contents structure 55. 
[0049] FIG. 20 shows a flowchart for the structured-contents display processing. The 
structured-contents display processing is executed by the structured-contents displaying unit 41 
in the printing processing unit 40. The structured-contents displaying unit 41 displays the table 
of structured contents based on the information set to the structured-contents structure 55 (step 
S21). Specifically, the structured-contents displaying unit 41 extracts the item of the first class 
from the structured-contents structure 55, obtains the file name of the converted files 53 
corresponding to the extracted item from the file-corresponding information 54. Then, the 
structured-contents displaying unit 41 extracts the character string (title) of the <TITLE> tag in 
the obtained converted-file 53, and displays, in tree structure, the title extracted on the structured 
contents 210 of the display screen 200. Thus, the title of each chapter as the item of the first 
class is displayed as a default of the table of structured contents. 

[0050] The structured-contents displaying unit 41 displays the title of the table of structured 
contents and the icon corresponding to the type of item description. For example, the 
structured-contents displaying unit 41 calculates the appearance ratio of tags of the same kind 
from the first <hr> tag to the last <hr> tag in the converted files 53 corresponding to the item to 
be displayed. If a predetermined tag appears by a ratio of 90% or more, the icon representing to 



24 



^ c • 

Serial No. 10/685,493 

the tag is displayed. 

[0051] FIG. 21 shows examples of the icon. Referring to FIG. 21 , if the <table> tag appears in 
the converted files 53 at a predetermined ratio, a table icon Is displayed on the item of the table 
of structured contents. If the <img> tag appears in the converted files 53 at a predetermined 
ratio, an image icon is displayed. If the <a> tag appears in the converted files 53 at a 
predetermined ratio, a link icon is displayed. If the three tags do not satisfy the predetermined 
appearance ratio, a book icon is displayed. 

[0052] The document displaying unit 42 displays, on the body text 220 on the display screen 
200, the description of the converted files 63 (HTML file) registered to the head item of the table 
of structured contents (step S22). 

[0053] FIG. 22 shows a flowchart for the printing processing. The printing processing is 
executed by the printing item obtaining unit 43 and the printing unit 44 in the printing processing 
unit 40. The printing item obtaining unit 43 obtains the item selected from the table of structured 
contents and the setting of presence or absence of the cover printing (step S31). The printing 
item is selected by user's marking with the check box of the item displayed on the structured 
contents 210 of the display screen 200. The presence or absence of the cover printing is 
determined by selecting the menu item for the cover printing operation set in the operation 
designating menu portion (menu bar) or the check box set in the cover printing operation window 
(not shown) on the display screen 200. Then, the user selects the menu item or marks with a 
check on the check box to set an indication of the printing the front cover. The printing item 
obtaining unit 43 determines whether the front cover is present or absent (step S32). If 
"presence of the front cover" is determined in step S32, the cover generating processing is 
performed (step S33). 

[0054] FIG. 23 shows a flowchart for the cover generating processing. The printing unit 44 
generates the file for printing job in a work area of a memory unit, and copies the description of 
the template of the cover file (step S40). Further, the printing unit 44 determines whether or not 
the item exists in the table of structured contents based on the structured-contents structure 55 
(step S41). When the item exists, the items of the table of structured contents are fetched one 
by one and determined whether or not the item is selected (step S42). If the item of the table of 
stnjctured contents is selected by referring to the value of the "bCheckOn" flag, the printing unit 
44 generates a temporary structure in the work area of the memory unit. Then, the printing unit 
44 determines whether or not a counter value of the total number of titles stored in the temporary 
structure is not more than a predetermined number (for example the value of 15) (step S43). 
[0055] When the counter value of the total of titles stored in the temporary structure is not more 
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than the predetermined number, the printing unit 44 obtains and stores the title of the item 
selected on the table of structured contents into the temporary structure. After that, the counter 
value of the total number of titles is counted up (step S44). Specifically, the printing unit 44 
obtains the file name of the converted files 53 corresponding to the extracted item from the file- 
corresponding information 54. Then, the printing unit 44 extracts and stores the character string 
(title) of the <title> tag in the obtained converted file 53 into the temporary structure, and 
increments the counter value of the total number of titles by one. Upon checking the class of the 
selected item and determining that the items belong to plural classes, the printing unit 44 stores 
only the title of the item of the upper class. Further, the titles of all the selected items may 
sequentially be stored. 

[0056] If the table of structured contents includes the selected item which is not processed yet, 
the printing unit 44 performs the processing in steps S43 and S44 during the term the counter 
value of the total number of titles does not reach the predetermined number. After the counter 
value of the total number of titles reaches the predetermined number, the printing unit 44 stores 
the character string "others... " at the end of the titles in the temporary structure (step S45). 
After that, the printing unit 44 determines the character string from the cover file 52 by linefeed 
code and obtains it by one line unit (step S46) and determines whether or not the <a 
name="item"> tag exists in the obtained character string (step S48) through the end of the file 
(step S47), When the <a name="item"> tag exists in the obtained character string, the printing 
unit 44 inserts the title of the item stored in the temporary structure into the file for printing job 
(step S4g). 

[0057] When the <a name="item"> tag does not exist in the character string, the printing unit 44 
obtains the character string by one-line unit (step S46), and repeats the processing in steps S46 
and S49 through the end of the file (step S47). 

[0058] After performing the cover generating processing (FIG. 22: step S33). the printing unit 44 
extracts the converted files 53 corresponding to the selected item in the order set in the file- 
corresponding information 54, and combines the converted files 53 in the order for extraction to 
the file for printing job (step S34). It is determined whether the presence or absence of the front 
cover upon printing (step S32). If "the absence of a front cover" is determined, the printing unit 
44 extracts the converted files 53 corresponding to the selected item in the order set in the file- 
corresponding information 54 and combines the converted files 53 into the file for printing job in 
the extracting order (step S35). 

[0059] The printing unit 44 adds a header part, a footer part or both of them and the page 
number to each pages except the front cover of the file for printing job, and then prints the file for 
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printing job onto sheets of paper so as to output tlie document 6 (step S36). For example, the 
printing unit 44 inserts the character strings of the title of the online manual and each title of 
items to the header part and the page number and the copyright notice to the footer part of the 
page for printing. 

[0060] FIG. 24 shows a flowchart for the preview processing. The preview processing is 
executed by the previewing unit 45 in the printing processing unit 40. The previewing unit 45 
determines whether or not the preview operation is selected (step S51). If the preview operation 
is selected, the previewing unit 45 obtains the file for printing job from the printing unit 44 (step 
S52). Then, the previewing unit 45 add the header part, the footer part, or both of them, and the 
page number to each page except the front cover of the file for printing job, and displays the 
contents of the file on a preview screen (not shown) (step S53). The previewing unit 45 closes 
and ends the preview screen when either return or end button is selected by click on the preview 
screen (step S54). 

[0061] According to the present invention, the processing unit, functions, or components are 
realized as a processing program which is read and is executed by a computer. Further, the 
processing program according to the present invention is stored in a proper recording medium 
which are read by a computer, such as a movable medium memory, semiconductor memory, or 
hard disk. The processing program is recorded to these recording medium and is provided. 
Alternatively, the processing program is provided by the reception and transmission using 
various communication networks via a communication interface. 

[0062] FIG. 25 shows examples of the functions (modules) used by the processing program 
when the electronic document printing system is realized by the computer-readable processing 
program. The description of the functions are given below: 

1) "OnSearchO" of "FindKyeword.cpp" is a function for starting a search processing, 

2) "MySetTreeViewStatusO" of "HTMLPrint.cpp" is a function for storing a value in structured 
contents structure 55 and 

"MyUnionHtmlFileO" is a function for combining the HTML files, 

3) "OnOptKail ()" of "HTMLPrintDoc.cpp" is a function for selecting "Display up to the first class", 
"OnOptKai2()" is a function for selecting "Display up to the second class", "OnOptKai3()" is a 
function for selecting "Display up to the third class", "OnOptKai4()" is a function for selecting 
"Display up to the fourth class", "OnFileOpenWebQ" is a function for selecting "Open Web 
manual" in the file menu and "OnFileOpenManu" is a function for selecting "Open Manual" in the 
file menu, 

4) "SetTreeViewStatusO" of "MainTreeView.cpp" is a function for selecting an item of the table of 
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structured contents, "OnSelchangedQ" is a function for displaying the file of the selected item, 
"OnCheckOnAIIQ" is a function for selecting "check all items" in the pop up menu, 
"OnCheckOffAIIO" is a function for selecting "reset all items" in the pop up menu, 
"OnCheckOnUnderO" is a function for selecting "check all items under classes" in the pop up 
menu, 

"OnCheckOffUnderO" is a function for selecting "reset all items" in the pop up menu, 
"OnCheckOnUnderPageO" is a function for selecting "check all subsequent pages" in the pop up 
menu, "OnCheckOfFUnderPageQ" is a function for selecting "reset all subsequent pages" in the 
pop up menu, "GetCheckStatusQ" is a function for determining on or off of check boxes in the 
entire table of structure contents, "MySetCheckStatusQ" is a function for determining on or off of 
check box and setting a flag and "MySelectitemQ" is a function for setting designated item in the 
table of structured contents to selected state, and 

5) "SetPrintPreviewQ" of "MainWebViewxpp" is a function for executing printing or previewing 
processing. 

[0063] According to the present invention, the user obtains an electronic document, for example 
a downloaded online help file or online manual file, as printed material having proper style of a 
booklet or a book, in irrelevant to the status of connection with the Internet. That is, according to 
the present invention, the user obtains a downloaded electronic document as printed material 
when the continuous access to the Internet or the dialup access is impossible. Further, the user 
can browse, by the Web viewer, the contents of the electronic document selected as a target for 
printing into the printed material and confirms the contents of the file to be printed. 
[0064] Furthermore, according to the present invention, the description of the files forming the 
electronic document are analyzed and converted to a description suitable for the printing as 
sheet material like paper. Thus, as compared with the simple continuous-printing of the HTML 
file of the electronic document formed for displaying, the description of the document to be 
printed is compressed and the number of printed sheets is reduced. 

[0065] Furthermore, according to the present invention, the user arbitrarily selects only the item 
for printing from the table of contents indicating the items generating the electronic document in 
tree structure, and obtains the document including only selected item as printed material with the 
proper form of booklet or a book. 

[0066] Although the present invention is described according to the embodiment, it can variously 
be modified within the essentials of the present invention. According to the above-described 
embodiment of the present invention, the electronic document printing system 1 has the file 
obtaining unit 10 including the downloading function via the network 2. However, the file 
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obtaining unit 10 in the electronic document printing system 1 does not have the downloading 
function, and can read the HTML files 51 stored in an external storage device and can perform 
the file analyzing and converting processing and the printing processing. 
[0067] Further, according to the above-described embodiments of the present invention, the 
electronic document printing system 1 includes one pieces of the analysis defining information. 
However, the electronic document printing system 1 can have a plurality of pieces of the analysis 
defining information which define information previously determined by the description rule or the 
template used in the case of describing in the markup language, the electronic document 
corresponding to the various electronic documents as the target of the printing processing. 
[0068] The entire disclosure of Japanese Patent Application No. 2003-305028 filed on October 
18, 2002 including specification, claims, drawings and summary are incorporated herein by 
reference in its entirety. 
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